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ABSTRACT 

Language  understanding  work  at  Paramax  focuses  on  ap¬ 
plying  general-purpose  language  understanding  technology 
to  spoken  language  understanding,  text  understanding,  and 
document  processing,  integrating  language  understeinding 
with  speech  recognition,  knowledge-based  information  re¬ 
trieval  and  image  understanding. 

1.  INTRODUCTION 

The  basic  language  understanding  technology  in  the 
Paramax  language  understanding  architecture  has  been 
designed  to  be  both  independent  of  the  language  input 
modality  (e.g.  spoken  input  vs.  scanned  OCR  input) 
as  well  as  independent  of  the  end  application  (e.g.  com¬ 
mand  and  control  vs.  data  extraction).  This  architec¬ 
ture  is  shown  in  Figure  1. 
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Figure  1:  Paramax  Language  Understanding  Inputs  and 
Applications 

2.  SPOKEN  LANGUAGE 
UNDERSTANDING 

The  focus  of  our  spoken  language  understanding  work  is 
the  development  of  a  domain-  and  application-  indepen¬ 
dent  dialog  processing  architecture  which  can  be  coupled 
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with  a  variety  of  speech  recognizers.  Some  of  our  recent 
achievements  include: 

•  a  non-monotonic  reasoning  capability,  which  will 
support  “what  if?”  exploratory  dialogs 

•  a  query  paraphrase  component  to  enhance  user- 
friendliness 

•  enhanced  reference  resolution  capabilities  to  sup¬ 
port  additional  types  of  context-dependent  refer¬ 
ences 

•  automated  training  techniques  for  semantics 

Figure  2  illustrates  the  integration  of  a  speech  recog¬ 
nizer,  language  understanding  component,  and  a  dialog 
manager  (VFE)  into  a  modular  spoken  language  archi¬ 
tecture  interacting  with  the  user  and  the  application.  A 
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Figure  2:  Paramax  spoken  language  system  architecture 

more  detailed  description  of  this  work  can  be  found  in 

[!]• 

3.  TEXT  UNDERSTANDING 

In  text  understanding  we  emphasize  an  architecture  in 
which  a  variety  of  components  cooperate  to  produce 
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an  analysis  of  texts.  Not  aU  components  may  be  re¬ 
quired  for  any  one  particular  application  -  they  can  be 
intermixed  in  various  ways  to  suit  eeich  application’s 
needs.  This  architecture,  shown  in  Figure  3  includes 
keyword-based  information  retrieval,  for  message  rout¬ 
ing.  A  knowledge-based  information  retrieval  compo¬ 
nent  (KBIRD)  extracts  text  information  which  is  ac- 
cessable  without  a  detailed  natural  language  analysis. 
If  an  application  requires  detailed  natural  language  pro¬ 
cessing,  a  natural  language  processing  system  is  avail¬ 
able.  Finally,  a  database  record  generator  formats  the 
extracted  information  for  database  generation.  This  ap¬ 
proach  is  described  in  [2]. 
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Figure  4:  Multiple  sources  of  knowledge  are  integrated 
in  this  document  processing  architecture. 
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Figure  3:  Paramax  text  understanding  architecture 


4.  DOCUMENT  UNDERSTANDING 

The  object  of  Paramttx  document  understanding  re¬ 
search  is  to  develop  novel  knowledge-based  approaches 
to  the  processing  of  document  images.  Starting  with  the 
scanned  image(s)  of  a  document’s  pages,  our  goal  is  to 
determine  the  document’s  functional  and  physical  orga¬ 
nization  and  to  extract  the  key  ideas  from  the  ASCII  rep¬ 
resentation  of  its  text.  Our  current  focus  is  on  producing 
the  structural  interpretation  of  the  document  and  the  ac¬ 
companying  ASCII  text  for  each  component.  The  ASCII 
text  would  then  be  analyzed  by  a  text-understanding 
system,  as  discussed  in  Section  2. 

An  intelligent  document  understanding  system  is  shown 
in  Figure  4  with  the  scope  of  our  project  falling  in  the 
shaded  region. 

This  work  is  discussed  in  detail  in  [3]. 
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